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Abstract: In this paper we present an interdisciplinary research project which aims at developing 
an electronic vocabulary acquisition system for the German and the Italian language (ELDIT). The 
basic dictionary will be enlarged with text and exercise units in order to support a systematic 
approach for studying and practicing the vocabulary. To ensure maximum effectiveness of the 
learning process, modem psycholinguistic methods are applied alongside with new media and 
technologies including adaptive hypermedia. The system is implemented as an adaptive hyperbook 
and runs on the WWW. 



1. Introduction 

Recent research in the field of computer assisted language learning (CALL) shows that multimedia and 
hypermedia teachware seems to be effective and motivating for language learning (Egert 00). Such technologies 
open new doors for the presentation, maintenance, and dissemination of complex knowledge and information units. 
However, students who are less experienced in the use of information technologies may get into trouble and lose 
control over the system, especially if they are expected to work independently of a teacher or a tutor. Adaptive 
systems cope with this problem by maintaining a model of each individual user, which helps the system to prepare 
individually designed pages and to guide the users according to their personal learning goals. 

Until now, such systems have mainly been developed for teaching natural science and computer science 
(Nejdl et al. 99, De Bra et al. 98). We are currently investigating the use of adaptive systems in the domain of 
foreign language learning. In the ongoing research project ELDIT we are developing an electronic vocabulary 
acquisition tool for the German and the Italian language. Modem linguistic and didactic issues are considered in the 
design of the system. It will further contain a model of each user, adapt the content to the individual needs and 
preferences of each user, and guide the user through a systematic but individually shaped vocabulary acquisition 
process. 

The paper is structured as follows: In section 2 the concept of learners’ dictionaries is introduced. In section 
3 the ELDIT dictionary is described in detail. Section 4 provides an overview of how we intend to enlarge the basic 
dictionary to an adaptive vocabulary acquisition system. In section 5 we discuss related work. 



2. Learners’ Dictionaries 
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Vocabulary acquisition is an important part of foreign language learning. To support this difficult task, 
lexicographers designed a special kind of dictionary, so-called learners 1 dictionaries. A learners’ dictionary serves 
both as a reference book to decode what the learner does not understand and as an instrument which supports text 
production (Aarts 99). Word definitions are simpler and might be supported by pictures. Carefully selected examples 
are very important for the illustration of the meaning of words, as well as typical lexico-grammatical patterns, etc. 
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Figure 1 shows the entry of the word „Fenster“ (English window ) in Langenscheidt's learners' dictionary 
„Deutsch als Fremdsprache“. In a printed dictionary space is limited, information is structured in a linear manner, 
and a lot of abbreviations are used. Therefore it is difficult to get an overview of the provided information, to 
understand which information is provided in detail, and where particular information can be found. Moreover, it is 
impossible to directly access semantically related words, except derivatives which, according to their usual 
lexicographic order, are physically close to the word under consideration. 
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Figure 1: The entry for the German word “Fenster” in Langenscheidt’s 
learners’ dictionary “Deutsch als Fremdsprache” 



Modem hypermedia and multimedia technologies allow to address a lot of these problems by providing 
new ways for structuring and presenting complex information. An overview of English learners dictionaries 
available on CD-ROM is given in (Nesi 99). Cambridge Dictionaries Online provides web-access to a range of 
English learners' dictionaries (Harley 00). 



3. The ELDIT Dictionary 

At the European Academy Bolzano we are currently developing an electronic learners dictionary called 
ELDIT (Elektronisches Lem(er)w6rterbuch Deutsch ITalienisch). ELDIT can be used both as a monolingual and as 
a bilingual dictionary. Currently we are implementing ELDIT for the German and Italian languages, but the software 
could easily be adapted to different language pairs. The following sections describe the system in more detail by 
concentrating on different aspects of content and design. 

3.1 A Sample Dictionary Entry 

Figure 2 shows a screenshot of the dictionary entry for the German word "Haus" (English house). The 
information is presented in two different frames. The left-hand frame shows the lemma Haus , a loudspeaker- 
button which on activation plays a sound file with the pronunciation of the word, and different meanings of the 
word. The right-hand frame shows various pieces of information about the word, which are collected in different 
tabs. Some of these tabs can only be activated once a specific meaning of the word is selected. In figure 2 the first 
meaning of the word "Haus" is actually selected, and the right-hand frame shows the collocation tab, which contains 
a list of the most frequent collocations together with their translation and an illustrative example. ELDIT contains a 
large number of patterns for word usage (collocations, idiomatic expressions, constructions) in combination with a 
translation of the patterns as well as illustrative examples. These data are presented in the tabs „Verwendu 

According to Aitchinson (Aitchinson 94), people not only remember words in a possible context, but group 
the words in their minds into multidimensional word nets. Several word nets are visualized in ELDIT, among them 
the semantic field which groups words by synonymy, hyperonymy, antonymy, etc. and can be accessed by 
activating the tab „Verwandte Worter“. 

Chapelle stresses the importance of drawing the users’ attention to linguistic characteristics of a specific 
language (Chapelle 98). ELDIT uses footnotes to inform learners about particular differences between the two 
languages. Clicking on a footnote opens a small window, where linguistic difficulties are explained (see figure 2). A 
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summary of all difficulties related to a specific word is shown in the „N.B “ tab. For a more comprehensive 
description of these features in ELDIT see (Abel 00). 
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Figure 2: ELDIT screenshot for the German word “Haus” 



3.2 Vocabulary Coverage 

The ELDIT system should support learners of the German and the Italian language to prepare for the exam 
of bilingualism which is required for all employees at public institutions in South-Tyrol, a bilingual region in the 
North of Italy. For the selection of the vocabulary included in ELDIT the following considerations have been taken 
into account. The so-called basic vocabulary of a language consists of approximately 3,000 words and covers about 
95% of the words in a normal text (Lewandowski 90). For both the Italian and the German languages an intersection 
of different basic vocabularies has been built. The two resulting vocabularies have been adjusted to each other in 
order to avoid big differences between the two languages. Finally, the two vocabularies have been enlarged by some 
frequently used words in South-Tyrol such as "farmer", „wine“, etc. The resulting vocabularies cover standard 
German and Italian with focus on the language variants spoken in South-Tyrol. 



3.3 Searching the Dictionary 

One of the most obvious advantages of electronic media over print media is their possibility to provide fast 
and efficient search capabilities. In ELDIT, the user can search a lemma directly or by one of its declined or 
conjugated forms, e.g. „ging‘‘ leads to the lemma „gehen“. The learner can also search more complex expressions 
such as collocations and idiomatic expressions, e.g. „aus dem Fenster schauen", by simply typing the whole 
expression into the search field. Furthermore, the user is allowed to omit some problematic parts of an expression 
and to replace it by wildcards, e.g. „Geb*de e*ichten“ results in the collocation „ein Gebaude errichten". Finally, 
ELDIT can detect spelling errors in the search expression and provide alternative propositions. 

3.4 Data Model 

Figure 3 shows a simplified version of our basic data model represented in an extended entity-relationship 
notation with generalisation/specialisation. The main objects in our domain are word entries, which are further 
classified into nouns, verbs, adjectives, and structure words (articles, prepositions, etc.). Every word entry is 
associated with various pieces of information. A distinction is made between information concerning the word itself 
(mainly lexical information) and information concerning a particular meaning of the word (mainly semantic 
information). The former class includes the lemma (e.g. "Haus"), morphological information (a word completed 
with article and plural form, e.g. "das Haus, die Hauser"), derivations (e.g. "das Hauschen” and "hauslich"), 
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idiomatic expressions (e.g. "jemandem ins Haus platzen"), and footnotes. The latter class concerns information 
about a specific word meaning and includes compound words (e.g. "Hausfrau"), collocations (e.g. "in einem Haus 
wohnen"), a list of adjectives which typically go with this noun (e.g. "baufallig“, ,,renovierungsbedurftig“), 
constructions which describe the use of a verb together with a noun or a preposition, and a picture. Words will be 
grouped into associative and semantically related word fields. These fields will be visualized as interactive graphs, 
where nodes represent the words (word entries) and edges are annotated with information explaining the difference 
between related words. 




Most of the above mentioned pieces of information are complex units each of which might be composed of 
a definition, a pattern, a translation, examples, explanations, and comments. For instance, a collocation consists of a 
pattern, a translation, and an example (see figure 2). 

The basic data model will be extended by several new classes of information units: extensive grammar 
units will provide grammatical background; exercises, tests, and text material will allow the learner to practice the 
acquired vocabulary and to train both reading and writing skills (gray part in figure 3) 



4. Towards an Adaptive Vocabulary Acquisition System 

In this section we discuss two extensions of the basic ELDIT system: adaptation to individual users and 
systematic vocabulary acquisition. We are currently implementing these features, an evaluation remains future work. 

4.1 Adaptation of Information Presentation 

A great advantage of hypermedia systems is their ability to be combined with adaptation techniques. This 
allows to adapt the system to individual users and to provide each user with the content which best fits his/her needs 
and wishes. Various user features have been identified to which the content presentation of the ELDIT dictionary 
should be adapted. 

First of all, the user's language skills and knowledge should be considered. A novice should not be faced 
with too many different word meanings; collocations and difficulties concerning specific words are much more 
helpful. An advanced learner wishes a more comprehensive picture of the language and needs more word meanings 
as well as a precise definition thereof, free combinations, and lots of idiomatic expressions. 

The second idea of adaptation concerns language for special purposes. The current version of ELDIT 
covers standard German and Italian, which is sufficient in many cases. However, people in South Tyrol are more 
and more required to use the second language at work, which means that professional terminology becomes 
increasingly important. Therefore, ELDIT should support the learning of languages for special purposes, e.g. by 
adapting the examples to the individual user’s background, occupation, business, or interests. 

There are several meaningful ways to adapt the presentation of the information to user preferences. 
Different users might focus on different pieces of information when they access a new word. ELDIT can record each 
user’s actions. If certain preferences are observed, the system shows the preferred information immediately, e.g. if a 
user almost always listens to the pronunciation of a word, the sound-file can be activated automatically every time 
the user accesses a dictionary entry. 

Another user feature for adaptation is the possibility to use the ELDIT system both as a monolingual and as 
a bilingual dictionary. 
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4.2 Systematic Vocabulary Acquisition 

We will now describe our ideas concerning the extension of the basic dictionary to a tool for systematic 
vocabulary acquisition. In order to retain the acquired vocabulary, „the learner needs to have opportunities to 
produce target language output" (Chapelle 98). ELDIT will be enlarged with text units for studying and practicing 
the vocabulary. The words in the text units will be linked to the corresponding dictionary entry, and the vocabulary 
to be trained will be highlighted, as it was successfully done in Ridder's study about reading and incidental 
vocabulary acquisition (De Ridder 00). Each text contains a couple of questions. The learner has to answer these 
questions in his/her foreign language, which serves as an indication whether he/she has understood the text and 
whether he/she is able to use the vocabulary. 

Furthermore, all word entries in ELDIT will be categorized in various groups. Such groups are, for 
example, the words belonging to a particular text or a specific domain, semantically related words, or the associative 
field. The ELDIT system will then support the following approaches: 



• Vocabulary acquisition by word groups'. The user can study a group of words and afterwards practice 
the acquired knowledge with a text. 

• Vocabulary acquisition by texts'. The user can choose a text, read it, check the meaning of unknown 
vocabulary in this particular context, try to answer the questions, and finally systematically explore and 
memorize the meanings, the usage, and the context of the unknown vocabulary. 

• Repetition of vocabulary. The user can decide to repeat the formerly acquired vocabulary and practice 
it with a text. 

In every situation the user can ask the system to choose an appropriate vocabulary group or a text that 
matches best. When choosing a text the system takes into account the frequency of known and unknown words 
within the chosen text as well as additional data stored in the user model. 



5. Related Work 

A large number of CD-ROMs and web-based tools for language learning are available, but very often these 
tools are just one-to-one copies of printed textbooks. Only a few systems exploit the potential of modem hypermedia 
technologies in a meaningful way (Gamper et al. 01). Closest to our work are the vocabulary acquisition system 
CAVOCA (Groot 00), the dictionary Alexia (Selva et al. 97), and the adaptive language learning program CASTLE 
(Murphy et al. 97). 

ELDIT provides more possibilities to apply, practice, and repeat the acquired vocabulary than CAVOCA. 
With approximately 3000 words our system contains far more dictionary entries than the monolingual system 
Alexia, which contains about 200 word entries. Regarding the design, both Alexia and ELDIT visualize related 
words as interactive graphs. ELDIT is a bilingual system and covers two languages not yet covered by the other two 
systems. Furthermore, the inclusion of adaptive components is planned. 

CASTLE (Murphy et al. 97) is one of the few adaptive language learning programs. The traditional 
grammatical approach is combined with a more functional communicative approach. ELDIT mainly differs from 
this system in its systematic approach to vocabulary acquisition, communicative components are provided as short 
dialogues which are included as illustrative examples. 



6. Conclusion 

In this paper we presented an ongoing research project concerning the development of an adaptive 
vocabulary acquisition system. The system consists of a learners' dictionary which contains about 3,000 word entries 
for the Italian and the German languages. The design of the dictionary is motivated by modem psycholinguists 
theories - such as the mental lexicon - which help the learner to remember more easily the acquired information. 
The basic dictionary will be extended with texts and exercises, and a systematic vocabulary acquisition process 
based on individual user interests and guided by the system will be provided. 

New media are explored in various ways in order to support information access and presentation. Hypertext 
and multimedia features allow new ways for structuring, linking and presenting complex knowledge and 
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information. Word entries as well as the various pieces of information composing a word entry can be organized, 
connected, and grouped in a non-linear way, which brings different views to the world of words in a dictionary. The 
presentation of the information can be tailored to the individual user needs. Visual and acoustic elements support 
knowledge transmission. Electronic search features provide a fast and efficient access to the information, which is a 
very important issue in dictionaries. 

Future work includes the extension of the basic dictionary with text units and grammatical units as well as a 
systematic and comprehensive evaluation of the system. 
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